PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pavir.7KG109200.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Panicodae; Paniceae; Panicinae; Panicum
Family HD-ZIP
Protein Properties Length: 716aa    MW: 78871.1 Da    PI: 5.7266
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pavir.7KG109200.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox60.33e-1973124556
                          SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
             Homeobox   5 ttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           +ft+eq+  Le+l++++++p+ ++r++LA+++gL+ rq+k+WFqNrR k k
  Pavir.7KG109200.1.p  73 KRFTAEQILGLESLYQRCPHPDDSTRKDLAARIGLDARQIKFWFQNRRNKVK 124
                          68***********************************************987 PP

2START96.35.7e-312284513205
                          HHHHHHHHHHHHHHC-TT-EEEE..EXCCTTEEEEEEESSS.........SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....E CS
                START   3 aeeaaqelvkkalaeepgWvkss..esengdevlqkfeeskv........dsgealrasgvvdmvlallveellddkeqWdetla....k 78 
                           e+a++e+v +a+ +ep+W      +++n  e+     +  +        + +e  + +++v +++ +lv  l d+  +W+++++     
  Pavir.7KG109200.1.p 228 TERALYEFVMLASKGEPMWLPATngKILNDLEYKDHTFP--GllgpcpqgFVMEGTKGTTLVRGNAFDLVGLLSDVT-RWSKMFPgiiqG 314
                          578899999999999999999995544444444444333..145788777889999999***********9999999.*******99954 PP

                          EEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--....-TTSEE-EESSEEEEEEEEC CS
                START  79 aetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe...sssvvRaellpSgiliepks 158
                          +   + +s g      g +q m+ +l +  p +p R   f+R++++ +  +w++vdvS d  +  +      +++ ++llpSg+l++++s
  Pavir.7KG109200.1.p 315 VRASNIVSGGsftsldGLIQKMNVDLWVQAPRAPnRSLKFLRFSKRIENNQWAVVDVSMDGIRGIEPdgrRIGYMSCRLLPSGCLLQDMS 404
                          4444445555999***********************************************988776667889****************** PP

                          TCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
                START 159 nghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                          ng +kvtw+ h+++++++++ l+r++ +sg+a ga++w+a lqr ce
  Pavir.7KG109200.1.p 405 NGLCKVTWIVHAEYDEASVPPLFRQFFQSGTALGASRWLASLQRRCE 451
                          ********************************************998 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.5E-1953124IPR009057Homeodomain-like
SuperFamilySSF466891.59E-1757124IPR009057Homeodomain-like
PROSITE profilePS5007116.8666126IPR001356Homeobox domain
SMARTSM003891.2E-1868130IPR001356Homeobox domain
CDDcd000861.60E-1769124No hitNo description
PfamPF000467.7E-1770124IPR001356Homeobox domain
PROSITE patternPS000270101124IPR017970Homeobox, conserved site
PROSITE profilePS5084831.023217455IPR002913START domain
SuperFamilySSF559615.04E-18223451No hitNo description
CDDcd088753.41E-72224451No hitNo description
SMARTSM002349.9E-14226452IPR002913START domain
PfamPF018523.3E-24229451IPR002913START domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 716 aa     Download sequence    Send to blast
MENNWPLNNN DGQGNNSLNP GNETGHLNWL NSLEEYDMDA LMGADDHVNN DQDEDDEEHN  60
LQGAESSSKR SGKRFTAEQI LGLESLYQRC PHPDDSTRKD LAARIGLDAR QIKFWFQNRR  120
NKVKVKAVGD ENKGIQQENA QLQVENMELQ QKLLCGSCRD PNEKWHLLNE NAKLKDTKRR  180
AQDYLIKLIH VTKVPHSETL EHLESAALNL VPFTDDCSTD QDTLVSYTER ALYEFVMLAS  240
KGEPMWLPAT NGKILNDLEY KDHTFPGLLG PCPQGFVMEG TKGTTLVRGN AFDLVGLLSD  300
VTRWSKMFPG IIQGVRASNI VSGGSFTSLD GLIQKMNVDL WVQAPRAPNR SLKFLRFSKR  360
IENNQWAVVD VSMDGIRGIE PDGRRIGYMS CRLLPSGCLL QDMSNGLCKV TWIVHAEYDE  420
ASVPPLFRQF FQSGTALGAS RWLASLQRRC EYMAIMHSSH SSGKNSVLEL SQRMMVSFYT  480
AVSKPVAPLD PSNMSTSGVA TVRMVIWNYS TMGQPSTLVL SATTTVWLPG TPPQRIHEYL  540
CDGQRRGEWD RFAYDGPVQE LSSIVTCRQL PGNVVSVLHP NDVLHQMNSN MLILQEATSD  600
LSCSLLVYSL IEKNMMRAVM DGGDNTTAFL LPSGFAILPD GYVSYAAAAG GASSSNVPNT  660
SQNGSAGSLL TAAYQALLSS SADHAAWTMD DAGNRICHAI SKILAAVGAD IAIPA*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1173179LKDTKRR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002446115.10.0hypothetical protein SORBIDRAFT_06g001940
TrEMBLK3YEP10.0K3YEP1_SETIT; Uncharacterized protein
STRINGSi012707m0.0(Setaria italica)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP83241742
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G04890.11e-116protodermal factor 2